Vocabulary Patterns in Free-for-all Collaborative Indexing Systems

نویسندگان

  • Wolfgang Maass
  • Tobias Kowatsch
  • Timo Münster
چکیده

In collaborative indexing systems users generate a big amount of metadata by labelling web-based content. These labels are known as tags and form a shared vocabulary. In order to understand the characteristics of that vocabulary, we study structural patterns of these tags by implying the theory of self-organizing systems. Therefore, we utilize the graph theoretic notion to model the network of tags and their valued connections, which represent frequency rates of co-occurring tags. Empirical data is provided by the free-for-all collaborative indexing systems Delicious, Connotea and CiteULike. First, we measure the frequency distribution of co-occurring tags. Secondly, we correlate these tags towards their rank over time. Results indicate a strong relationship among a few tags as well as a notable persistence of these tags over time. Therefore, we make the educated guess that the observed collaborative indexing systems are self-organizing systems towards a shared vocabulary building. Implications on the results are the presence of semantic domains based on high frequency rates of co-occurring tags, which reflect topics of interest among the user community. When observing those semantic domains over time, that information can be used to provide a historical or trend-setting development of the community’s interests, thus enhancing collaborative indexing systems in general as well as providing a new tool to develop community-based products and services at the same time.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Tagging, Folksonomy & Co - Renaissance of Manual Indexing?

This paper gives an overview of current trends in manual indexing on the Web. Along with a general rise of user generated content there are more and more tagging systems that allow users to annotate digital resources with tags (keywords) and share their annotations with other users. Tagging is frequently seen in contrast to traditional knowledge organization systems or as something completely n...

متن کامل

0 Ja n 20 07 Tagging , Folksonomy & Co - Renaissance of Manual Indexing ? ∗

This paper gives an overview of current trends in manual indexing on the Web. Along with a general rise of user generated content there are more and more tagging systems that allow users to annotate digital resources with tags (keywords) and share their annotations with other users. Tagging is frequently seen in contrast to traditional knowledge organization systems or as something completely n...

متن کامل

The Impact of Pre-Defined Terms on the Vocabulary of Collaborative Indexing Systems

Collaborative indexing systems have attracted an increasing amount of attention over the last three years. One fundamental limitation to such a system is the uncontrolled nature of its vocabulary, as this consists of terms users freely choose to index resources. As a result, the vocabulary can be poorly structured, making it difficult to harvest knowledge from the user community. Pre-defined te...

متن کامل

Use of Semantic Similarity and Web Usage Mining to Alleviate the Drawbacks of User-Based Collaborative Filtering Recommender Systems

  One of the most famous methods for recommendation is user-based Collaborative Filtering (CF). This system compares active user’s items rating with historical rating records of other users to find similar users and recommending items which seems interesting to these similar users and have not been rated by the active user. As a way of computing recommendations, the ultimate goal of the user-ba...

متن کامل

Indexing and Retrieving Images in a Multilingual World

Introduction This communication presents the problem statement, the methodology and the preliminary results of a research project aiming to compare two different approaches for indexing images, namely: traditional image indexing with the use of controlled vocabularies, or free image indexing using uncontrolled vocabulary. The experiment intends to measure their respective performance for image ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007